使用麦克风阵列的扬声器定位取决于准确的时间延迟估计技术。几十年来,基于与相变的广义跨相关性(GCC-PHAT)的方法已被广泛用于此目的。最近,GCC-PHAT也已用于为神经网络提供输入特征,以消除噪声和混响的影响,但以无噪声条件下的理论保证为代价。我们提出了一种新的方法来扩展GCC-PHAT,其中使用移位模糊的神经网络过滤接收的信号,该神经网络保留信号中包含的时序信息。通过广泛的实验,我们表明我们的模型始终减少不利环境中GCC-PHAT的误差,并保证在理想条件下确切的时间延迟恢复。
translated by 谷歌翻译
Participants in political discourse employ rhetorical strategies -- such as hedging, attributions, or denials -- to display varying degrees of belief commitments to claims proposed by themselves or others. Traditionally, political scientists have studied these epistemic phenomena through labor-intensive manual content analysis. We propose to help automate such work through epistemic stance prediction, drawn from research in computational semantics, to distinguish at the clausal level what is asserted, denied, or only ambivalently suggested by the author or other mentioned entities (belief holders). We first develop a simple RoBERTa-based model for multi-source stance predictions that outperforms more complex state-of-the-art modeling. Then we demonstrate its novel application to political science by conducting a large-scale analysis of the Mass Market Manifestos corpus of U.S. political opinion books, where we characterize trends in cited belief holders -- respected allies and opposed bogeymen -- across U.S. political ideologies.
translated by 谷歌翻译
We describe PromptBoosting, a query-efficient procedure for building a text classifier from a neural language model (LM) without access to the LM's parameters, gradients, or hidden representations. This form of "black-box" classifier training has become increasingly important as the cost of training and inference in large-scale LMs grows. But existing black-box LM classifier learning approaches are themselves computationally inefficient, typically specializing LMs to the target task by searching in a large space of (discrete or continuous) prompts using zeroth-order optimization methods. Instead of directly optimizing in prompt space, PromptBoosting obtains a small pool of prompts via a gradient-free approach and then constructs a large pool of weak learners by pairing these prompts with different elements of the LM's output distribution. These weak learners are then ensembled using the AdaBoost algorithm. The entire learning process requires only a small number of forward passes and no backward pass. Experiments show that PromptBoosting achieves state-of-the-art performance in multiple black-box few-shot classification tasks, and matches or outperforms full fine-tuning in both few-shot and standard learning paradigms, while training 10x faster than existing black-box methods.
translated by 谷歌翻译
We test grip strength and shock absorption properties of various granular material in granular jamming robotic components. The granular material comprises a range of natural, manufactured, and 3D printed material encompassing a wide range of shapes, sizes, and Shore hardness. Two main experiments are considered, both representing compelling use cases for granular jamming in soft robotics. The first experiment measures grip strength (retention force measured in Newtons) when we fill a latex balloon with the chosen grain type and use it as a granular jamming gripper to pick up a range of test objects. The second experiment measures shock absorption properties recorded by an Inertial Measurement Unit which is suspended in an envelope of granular material and dropped from a set height. Our results highlight a range of shape, size and softness effects, including that grain deformability is a key determinant of grip strength, and interestingly, that larger grain sizes in 3D printed grains create better shock absorbing materials.
translated by 谷歌翻译
Measuring growth rates of apple fruitlets is important because it allows apple growers to determine when to apply chemical thinners to their crops to optimize yield. The current practice of obtaining growth rates involves using calipers to record sizes of fruitlets across multiple days. Due to the number of fruitlets needed to be sized, this method is laborious, time-consuming, and prone to human error. In this paper, we present a computer vision approach to measure the sizes and growth rates of apple fruitlets. With images collected by a hand-held stereo camera, our system detects, segments, and fits ellipses to fruitlets to measure their diameters. To measure growth rates, we utilize an Attentional Graph Neural Network to associate fruitlets across different days. We provide quantitative results on data collected in an apple orchard, and demonstrate that our system is able to predict abscise rates within 3% of the current method with a 7 times improvement in speed, while requiring significantly less manual effort. Moreover, we provide results on images captured by a robotic system in the field, and discuss the next steps to make the process fully autonomous.
translated by 谷歌翻译
对心脏磁共振成像(MRI)进行心室分割时具有弹性的方法,对于确保对这些组织的结构和功能分析的质量至关重要。尽管在提高算法的质量方面做出了重大努力,但很少有作品能够应对伪像在预测中产生的危害。在这项工作中,我们研究了经过验证的网络的微调,以提高以前方法对这些工件的弹性。在我们提出的方法中,我们采用了模仿这些人工制品的数据增强的广泛使用。结果显着改善了基线分割(最高0.06个骰子得分和4mm的Hausdorff距离提高)。
translated by 谷歌翻译
近年来,机器人的操纵和控制的重要性增加了。但是,在现实世界应用中需要操作时,最新技术仍然存在局限性。本文探讨了在模拟环境和真实环境中重播的事后观看经验,突出了其弱点,并根据奖励和目标塑造提出了基于加强学习的替代方案。此外,还发现了一些研究问题以及可以探索以解决这些问题的潜在研究方向。
translated by 谷歌翻译
语言变化的研究研究了语言在不同的说话者组之间和内部的变化,从而阐明了我们如何使用语言来构建身份以及社会环境如何影响语言的使用。一种常见的方法是在语料库中识别某些语言特征的实例 - 例如零copula构造,并分析该功能在扬声器,主题和其他变量之间的分布,以便对功能或系统地了解该功能测量变化。在本文中,我们探讨了低资源英语品种中自动形态句法特征检测的具有挑战性的任务。我们提出了一种通过语料库引导的编辑生成和过滤有效的对比度集的人类在环境中的方法。我们表明,我们的方法改善了印度英语和非裔美国人英语的功能检测,展示了它如何帮助语言研究,并发布了我们的微调模型,以供其他研究人员使用。
translated by 谷歌翻译
成像表明临床前和人类肿瘤是异质性的,即单个肿瘤可以表现出多个区域,在正常发育过程中均表现出不同的行为,也可以反应治疗。在对照组肿瘤中观察到的大变化可能会掩盖由于归因于变化原因的歧义而导致的显着治疗作用的检测。由于实验设计的局限性,而不是由于治疗衰竭,这可能会阻碍有效疗法的发展。描述了对成像信号中生物变异和异质性进行建模的改进方法。具体而言,线性泊松建模(LPM)在放疗前和72小时之前评估了两种结直肠癌的异种移植模型,在放疗前和72小时后评估了明显的扩散效率(ADC)的变化。使用基本ADC分布参数的常规t检验分析将测量变化的统计显着性与可实现的变化的统计显着性进行了比较。当LPM应用于治疗的肿瘤时,LPM检测到了高度显着的变化。与常规方法相比,所有肿瘤的分析对于所有肿瘤都很重要,相当于4倍的增益(即等同于样本量大16倍)。相比之下,只有使用t检验在队列水平上检测到极大的变化,从而限制了其在个性化医学中的潜在用途,并增加了测试过程中所需的动物数量。此外,LPM使每个异种移植模型估计响应和非反应组织的相对体积。对处理过的异种移植物的剩余分析提供了质量控制并确定了潜在的异常值,从而提高了对临床相关样本量的LPM数据的信心。
translated by 谷歌翻译
尽管现在使用自我监督方法构建的计算机视觉模型现在很普遍,但仍然存在一些重要问题。自我监督的模型是否学习高度冗余的频道功能?如果一个自我监督的网络可以动态选择重要的渠道并摆脱不必要的渠道怎么办?目前,与计算机视觉中的有监督的对手相比,通过自我训练预先训练的Convnet在下游任务上获得了可比的性能。但是,有一些自我监督模型的缺点,包括大量参数,计算昂贵的培训策略以及对下游任务更快推断的明确需求。在这项工作中,我们的目标是通过研究如何将用于监督学习的标准渠道选择方法应用于经过自学训练的网络。我们验证我们在一系列目标预算上验证我们的发现$ t_ {d} $,用于跨不同数据集的图像分类任务的频道计算,特别是CIFAR-10,CIFAR-100和IMAGENET-100,获得了与原始网络的可比性性能when selecting all channels but at a significant reduction in computation reported in terms of FLOPs.
translated by 谷歌翻译